Measuring the energy landscape roughness and the transition state 

location of biomolecules using single molecule mechanical unfolding 

^ ! experiments 
o 

Q ■ Changbong Hyeon 1 & D. Thirumalai 1 ' 2 

1 Biophysics Program, 
Institute for Physical Science and Technology, 
Q ■ University of Maryland, 



CO 



o 
o 



> 
m 
m 

O 
-i— > 



X 



College Park, MD 20742 
2 Department of Chemistry and Biochemistry, 
University of Maryland, 
College Park, MD 20742 



(Dated: February 6, 2008) 

Single molecule mechanical unfolding experiments are beginning to provide profiles 
of the complex energy landscape of biomolecules. In order to obtain reliable estimates 



of the energy landscape characteristics it is necessary to combine the experimental 
measurements (the force extension curves, the mechanical unfolding trajectories, 
force or loading rate dependent unfolding rates) with sound theoretical models and 
simulations. Here, we show how by using temperature as a variable in mechanical 
unfolding of biomolecules in laser optical tweezer or AFM experiments the roughness 
of the energy landscape can be measured without making any assumptions about 
the underlying reaction coordinate. The efficacy of the formalism is illustrated by 
reviewing experimental results that have directly measured roughness in a protein- 
protein complex. The roughness model can also be used to interpret experiments 
on forced-unfolding of proteins in which temperature is varied. Estimates of other 
aspects of the energy landscape such as free energy barriers or the transition state 
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(TS) locations could depend on the precise model used to analyze the experimental 
data. We illustrate the inherent difficulties in obtaining the transition state location 
from loading rate or force-dependent unfolding rates. Because the transition state 
moves as the force or the loading rate is varied it is in general difficult to invert the 
experimental data unless the curvature at the top of the one dimensional free energy 
profile is large, i.e the barrier is sharp. The independence of the TS location on force 
holds good only for brittle or hard biomolecules whereas the TS location changes 
considerably if the molecule is soft or plastic. We also comment on the usefulness of 
extension of the molecule as a surrogate reaction coordinate especially in the context 
of force-quench refolding of proteins and RNA. 



INTRODUCTION 

Representation of the large conformational space of RNA and proteins in terms of (low- 
dimensional) energy landscape has played an important role in visualizing their folding routes 
. It is suspected that the energy landscape of many evolved proteins is relatively smooth 
so that they can be navigated very efficiently. By smooth we mean that the gradient of the 
energy landscape AF(x,Rg) towards the native basin of attraction (NBA) is "large" enough 
that the biomoiecule does not get kinetically trapped in competing basins of attraction for 
long times during the folding process. Here, F(x, R g ) is expressed in terms two non-unique 
variables, namely, the radius of gyration R g and Xi ( or equivalently Q the fraction of native 
contacts), an order parameter that measures how similar a given conformation is to the native 
state. However, perfectly smooth with energy landscapes are difficult to realize because of 
energetic and topological frustration 4], |5j- In proteins, the hydrophobic residues prefer to be 
sequestered in the interior while polar and charged residues are better accommodated on the 
surfaces where they can interact with water. Often these conflicting requirements cannot be 
simultaneously satisfied and hence proteins can be energetically "frustrated". It is clear from 
this description that only evolved or well designed sequences can minimize energetic frustration. 
Even if a particular foldable sequence minimizes energetic conflicts it is nearly impossible to 



eliminate topological frustration which arises due to chain connectivity 



7|. If the packing of 
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locally formed structures is in conflict with the global fold then the polypeptide or polynucleotide 
chain is topologically frustrated. Both sources of frustration, energetic and topological, render 
the energy landscape rugged on length scales that are larger than those in which secondary 
structures (~ (1 — 2) nm) form even if folding can be globally described using only two-states 
(folded and unfolded). 

An immediate consequence of frustration is that the free energy, projected along a one di- 
mensional coordinate, is rough on certain length scale and may be globally smooth on larger 
scale. Let us assume that the characteristic roughness e has a Gaussian distribution. Following 
footnote 45 in |8J the overall transit time from the unfolded basin may be written as 



r09) / dee^e- 1 ' 2 ^ 2 « r(/3)e^ 2 / 2 
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where e is the average value of ruggedness. The second part of the equation holds good at 
low temperatures. The additional slowing down in the folding time Ty^p arising from the 
second term in EqJTJ was derived in an elegant paper by Zwanzig [9] and was also obtained 
in IPJ by analyzing the dynamics of Derrida's random energy model If fie is small then 
tu->f ~ T e /3AF * where AF^ is the overall folding free energy barrier. 

If folding takes place in a rough energy landscape then the characteristic scale-dependent 
time scale may be estimated as 

t ss « (10 - 100)ns |«(l-2)nm 

(2) 

Tu_ F I « L 

where L is the effective contour length of the biomolecule. We have assumed that structures on 
I m (1 — 2) nm form in Tss ~ I 2 / D where the diffusion constant is on the order of (10 -6 — 10~ 7 ) 
cm 2 / sec. The estimate of Tss is not inconsistent with the time needed to form a-helices or 
/3-hairpin especially given the crude physical picture. 

With the possibility of manipulating biological molecules (FigHJ), one molecule at a time, 
using force it is becoming possible to probe the features of their energy landscape (such as 
roughness and the transition state location) that are not easily possible using conventional 
experiments. Such experiments, performed using Laser Optical Tweezers (LOTs) {12I . 
Atomic Force Microscopy (AFM 
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2l|, RNA 



or 
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J4 k h a ve made it possible to mechanically unfo d proteins 
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25 



26|, 



23, 



281 ] . and their complexes 
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|30j, [31|, 13J 
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34| , or initiate refolding of proteins 35j and RNA [26j, |27| • These remarkable experiments 



show how the initial conditions affect refolding and also enable us to examine the response of 



biological molecules over a range of forces and loading rates. In addition^ 



of statistical mechanics, including non-equilibrium work theorems 36|, |37j, can be rigorously 



undamental aspects 



tested using the single molecule experiments 



38 



39(. Here, we are concerned with using the 



data and theoretical models to extract key characteristics of the energy landscape of biological 
systems. 

The crude physical picture of folding in a rough energy landscape (Figj2J) is not meaningful 
unless the ideas can be validated experimentally which requires direct measurement of the 
roughness energy scale e, barrier height, etc. In conventional experiments, in which folding is 
triggered by temperature, it is difficult to measure tq and AF$ even when f3e = [40|. We 
proposed, using theoretical methods, that (3e can be directly measured using forced-unfolding 



4l| . which 



of biomolecules and biomolecular complexes. The Hyeon-Thirumalai (HT) theory 
was based on Zwanzig's treatment 9f, showed that if unbinding or unfolding lifetime (or rates) 
are known as a function of the stretching force (/) and temperature (T) then e can be inferred 
without explicit knowledge of r or AF*. Recently, the loading-rate dependent unbinding times 
of a protein-protein complex using atomic force microscopy (AFM) at various temperatures have 



been used to obtain an estimate of e 



42J. Similarly, variations in the forced-unfolding rates as a 



unction of temperature of Dictyostelium discoideum filamin (ddFLN4) were used to estimate e 



43j . Although alternate interpretation of the data is proposed for temperature effect on ddFLN4 



the variation in unbinding or unfolding rates of proteins as a function of / and T provides an 
opportunity to obtain quantitative estimates of the energy landscape characteristics. 

Single molecule force spectroscopy can also be used to measure force-dependent unfolding 
rates from which the location of the transition state (TS) in terms of the spatial extension (R) 
can be computed. This procedure is a not straightforward because, as shown in a number of 



studies 
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45 



46 



47|, 



481 ]. the location of the transition state changes as / changes unless the 



curvature of the free energy profile at the TS location is large, i.e, the barrier is sharp. The 
extent to which the TS changes depends on the load. We propose distinct scenarios for variation 
of the TS location as the external conditions change. By carefully considering the variations of 



force distributions it is possible to obtain reliable estimates of the TS location 



431 ] . Here, we 



5 



review recent developments in single molecule force spectro scop y that have attempted to obtain 



41 



48 



49 



50 



51 



52 



53 



54J. Using 



the energy landscape characteristics of biological molecules 
theoretical models we also point out some of the ambiguities in interpreting the experimental 
data from dynamic force spectroscopy. 



THEORETICAL BACKGROUND 



Single molecule mechanical unfolding experiments differ from conventional unfolding exper- 
iments in which unfolding (or folding) is triggered by varying temperature or concentration of 
denaturants or ions. In single molecule experiments folding or unfolding can be initiated by 
precisely manipulating the initial conditions. In both forced-unfolding and force-quench refold- 
ing the initial conformation, characterized by the extension of the biomolecules, are precisely 
known. On the other hand, the nature of the unfolded states, from which refolding is initiated, 
is hard to describe in ensemble experiments. For RNA and proteins, whose energy landscape is 
complex [2I, , details of the folding pathways can be directly monitored by probing the time 
dependent changes in the end-to-end distance R(t) of individual molecules. Analysis of such 
mechanical folding and unfolding trajectories allows to explore regions of the energy landscape 
that are difficult to probe using ensemble experiments. 

Force experiments can measure the extension of the molecule as a function of time. There are 
three modes in which the stretching experiments are performed. Most of the initial experiments 



were performed by unfolding biomolecules (especially proteins) by pu 



at constant velocity while keeping the other end fixed 16 



22 



23 



56 



l ing on one of the molecule 



571 ] . More recently, it has 



Decome possib 
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35 
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e to apply constant force on the molecule of interest using feed-back mechanism 



60] . In addition, force-quench experiments have been reported in which the 



forces are decreased or increased linearly 
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35]. It is hoped that a combination of such 



experiments can provide a detailed picture of the complex energy landscape of proteins and 
RNA. In all the modes, the variable conjugate to / is the natural coordinate that describes the 
progress of the reaction of interest (folding, unbinding or catalysis). If there is a energy barrier 
confining the molecular motion to a local minimum, whose height is greater than fc^T, then 
a sudden increase (decrease) of extension (force) signifies the transition of the molecule over 



6 



the barrier. A cusp in the force-extension curve (FEC) is the signature of such a transition. 



Surprisingly, for proteins and RNA it has been found that t 



using the semi-flexible or worm-like chain model 16 



22 



re FECs can be quantitatively fit 
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57 



61 



621 ] . From such fits, the 



global polymeric properties of the biomolecule, such as the contour length and the persistence 



length can be extracted 
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Single molecule experiments provide distributions of the unfolding times (or unfolding force) 
by varying external conditions. The objective is to construct the underlying energy landscape 
from such measurements and from mechanical folding or unfolding trajectories. However, it 
is difficult to construct from FEC or mechanical folding trajectories that report only changes 
at two points all the features of the energy landscape of biomolecules. For example, although 
signature of roughness in the energy landscape may be reflected as fluctuations in the dynamical 
trajectory it is difficult to estimate its value unless multiple pulling experiments are performed. 
We had proposed that the power of single molecules can be more fully realized if temperature 
(T) is also used as an additional variable 
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45 



63 



64|. By using T and / it is possible to 



obtain the phase diagram as a function of T and / that can be used to probe the nature of 
collapsed molten globules which are invariably populated but are hard to detect in conventional 
experiments. We also showed theoretically that the roughness energy scale (e) can be measured 



41] if both / and temperature (T) are varied in single molecule experiments. The effect of 



e manifests itself in the 1/T 2 dependence of the rates of force-induced unbinding or unfolding 
kinetics. In the following subsections we first review the theoretical framework to describe the 
force-induced unfolding kinetics and show how the 1/T 2 dependence emerges when the roughness 
is treated as a perturbation in the underlying energy profile. 

Bell model: Historically, a phenomenological description of the forced-unbinding of adhesive 



contacts was made by Bell 



491 ] long before single molecu 



context of ligand unbinding from binding pocket, Bell 



e experiments were performed. In the 



49] conjectured that the kinetics of bond 



rupture can be described using a modified Eyring rate theory 651 ] . 



k = r } BT c - (E% ' lf),kBT 
h 



(3) 



where fee is the Boltzmann constant, T is the temperature, h is the Planck constant, and k is the 
transmission coefficient. In the Bell description the activation barrier E* is reduced by a factor 
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7 x / when the bond or the biomolecular complex is subject to external force /. The parameter 
7 is a characteristic length of the system under load and specifies the distance at which the 
molecule unfolds or the ligand unbinds. The prefactor ^f- is the vibrational frequency of a 
single bond. The Bell model shows that the unbinding rates increase when tension is applied 
to the molecule. Although Bell's key conjecture, i.e., the reduction of activation barrier due to 
external force, is physically justified, the assumption that 7 does not depend on the load is in 
general not valid. In addition, because of the multidimensional nature of the energy landscape 
of biomolecules, there are multiple unfolding pathways which require modification of the Bell 
description of forced-unbinding. It is an oversimplification to restrict the molecular response to 
the force merely to a reduction in the free energy barrier. Nevertheless, in the experimentally 
accessible range of loads the Bell model in conjunction with Kramers' theory of escape from a 
potential well have been remarkably successful in fitting much of the data on forced-unfolding 
of biological molecules. 

Mean first passage times: In order to go beyond the popular Bell model many attempts have 
been made to describe unbinding process as an escape from a free energy surface in the presence 
of force 4jJ, |5(J, [52|, |66j, |67j]. This is traditionally achieved by a formal procedure that adapts the 
Liouville equation that describes the time evolution of the probability density representing the 
molecular configuration on the phase space. 

For the problem at hand, one can project the entire dynamics onto a single reaction coordi- 
nate provided the relaxation times of other degrees of freedom are shorter than the time scale 
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69| | . In applications to force- 



associated with the presumed order parameter of interest 
spectroscopy, we assume that the the variable conjugate to / is a reasonable approximation 
to the reaction coordinate. The probability density of the molecular configuration, p(x,t\xo), 
whose configuration is represented by order parameter x at time t, obeys the Fokker Planck 
equation. 

dp(x,t\xn) , s „ » d . / d 1 dF(x)\ , . . 

- £ fp (x)p(x, t\x ) = ^-D{x) — + — — ~t~~ P(x, t\x ), (4) 



dt dx \dx ksT dx 

where D(x) is the position-dependent diffusion coefficient, and F(x) is an effective one- 
dimensional free energy, xq is the position at time t = 0. If the initial distribution is given 



8 



by p(x ,t = 0\x ) = 5(x — x Q ) the formal solution of the above equation reads p(x, t\x ) = 
e FP 5(x — xq). If we use absorbing boundary condition at a suitably defined location, the 
probability that the molecule remains bound (survival probability) at time t is 



S(xo,t) = J dxp(x 1 t\xo) = J dxe fp 5(x — xq). (5) 

In terms of the the first passage time distribution, PFp(x ,t)(= —dS(x ,t)/dt), the mean first 
passage time can be computed using, 

POO 

t(x ) = / dt[tp FP (x ,t)] 



POO 

/ dt 


' dS(x ,t)~ 


lo 


dt 



dt J dxe tCpp{x) 5{x - Xq) 

dt [ dx5(x - x )e tcpp(x) (6) 



where C^ FP is the adjoint operator. In obtaining the above equation we used S(xo,t = oo) = 
and integrated by parts in going from the second to third third line. By operating on both sides 
of EqJHlwith C* fp (xq) and exchanging the variable x with x Q we obtain 

C fp {x)t{x) = e F ^/ kBT ^-D(x)e- F ^/ kBT ^-r(x) = -1. (7) 

The rate process with reflecting boundary d x r{a) = and absorbing boundary condition r(b) = 
in the interval a < x < b, leads to the expression of mean first passage time, 

r( x )= f dye F ^' hBT -^- P ' dze~ F ^l kpJ \ (8) 
Jx D(y) J a 

Diffusion in a rough potential: In the above analysis the one-dimensional free energy profile 
F(x) that approximately describes the unfolding or unbinding event is arbitrary. In order to 
explicitly examine the role of the energy landscape ruggedness we follow Zwanzig and decompose 
F(x) into F(x) = F (x) + Fi(x) [9|. where F (x) is a smooth potential that determines the global 
shape of the energy landscape, and Fi(x) is the periodic ruggedness that superimposes F 1 (x). 
By taking the spatial average over Fi(x) using (e ±l3Fl ^}i = j J ' dxe ±/3Fl< ^ x \ where I is the 
ruggedness length scale, the associated mean first passage time can be written in terms of the 
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effective diffusion constant D*(x) as, 



D*(x) 



T[X 



D(x) 



(eP F ^))i(e-P F ^))i' 



■F Q (z)/k B T 



(9) 



An inversion of roughness barrier, i.e., F\ <-> —F\ does not alter D*(x). In the presence of 
roughness D*(x) < D(x). Depending on the distribution of roughness barrier, D*(x) can take 
various forms: 

1. For Fi(x) = ex/b (0 < x < b) and Fi(x) = e(a — x)/(a — b) (b < x < a) with Fi(x) = 



F^x + a), ( e ^i(*))( e -^W) 



sinh/3e/2 
(3e/2 



n 2 



70j. 



2. For Fx (a;) = ecosga;, ( e WW)( e -mW) = [J (/5e)] 2 |9j. 

3. If P(Fi) is Gaussian with variance (F?) = e 2 , ( e ^W)( e -WW) = e /^ 2 |g|. 

4. If P(F0 = P(-Fi), then (e^W), = J dF 1 P(F 1 ) [l + ±f3 2 e 2 + i/5 4 e 4 + •••]. 

In all cases when /3e is small the effective diffusion coefficient can be approximated as Dq « 
Z)exp (— /3 2 e 2 ) where Do is the bare diffusion constant. If P(Fi) is a Gaussian then this expres- 
sion is exact. 

From the recent experimental analysis on mechanical unfolding kinetics of the multi-ubiquitin 

r~i 

construct, Fernandez and coworkers [60( obtained a power-law distribution of unfolding times 
i.e., P(t) oc T~^ 1+a \ and showed that the distribution of energy barrier heights should be 
P(Fi) ~ exp(— \Fi\/Fi) where F 1 = ksTja [60|. This case belongs to class 4, thus one obtains 
e - e2 / k B T2 behavior associated with the effective diffusion coefficient. These examples show that 
the coefficient associated with 1/T 2 behavior is due to the energy landscape roughness provided 
the extension is a good reaction coordinate. The dependence of e~ e2 ^ k s T in D* suggests that 
the diffusion in rough potential can be substantially slowed even when the scale of roughness is 
not too large. 

Barrier crossing dynamics in a tilted potential: In writing the one-dimensional Fokker-Planck 
equation (EqfjJ we assumed that the order parameter a; is a slowly changing variable. This 
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assumption is valid if the molecular extension, in the presence of /, describes accurately the 
conformational changes in the biomolecule. 

Following the Bell's conjecture we can replace F(x) by F(x) — f ■ x in which / "tilts" the 
free energy surface. Thus, in the presence of mechanical force Eqj9] becomes 



k- 1 (f) = r(f;x) 



dye 



(Fo(y)-fy)/k B T_ 



1 



dze -(F (z)-fz)/k B T _ 



(10) 



D*(y) J a 

As long as the energy barrier is large enough (see Figj2]) Eq{10] can be further simplified using 
the saddle point approximation. The Taylor expansions of the free energy potential Fq(x) — fx 
at the barrier top and the minimum result in the Kramers' equation 7_l|, |72|] , 

2vr/c B T 



k-\f) = r(f) 



D*muj b (f)u ts (f) 



o 0(AF*{f)-fAx(f)) 



2ttC 



y^Wu^Wl] e /3(AF*(/)-/Ax(/)) 



u b (f)uj ts (f) 

where u b and u ts are the curvatures of the potential, \dlF (x)\, at x = x b and x ts , respectively, 
the free energy barrier AFq = F Q (x ts ) — F Q (x b ), m is the effective mass of the biomolecule, ( is 
the friction coefficient, and Ax = x ts — x b . 

In the presence of /, the positions of transition state x ts and bound state x b change because 
unbinding kinetics should be determined using F (x) — fx and not F (x) alone. Because x ts 
and x b satisfy the force dependent condition Fq(x) — f = 0, it follows that all the parameters, 
Ax(f), u ts (f), and ou b (f), are intrinsically /-dependent. Depending on the shape of free energy 



potential -Fq^), the degree of 



theoretical studies 



44 



51 



52 



orce-dependence of Ax, u ts , and u b can vary greatly. Previous 



73| have examined some of the consequences of the moving 
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45 



4q in which the free energy profiles 



transition state. In addition, simulational studies 
were explicitly computed from thermodynamic considerations alone clearly showed the change 
of Ax when / is varied. These authors also provided a structural basis for transition state 
movements in the case of unbinding of simple RNA hairpins. The nontrivial coupling of force 
and free energy profile makes it difficult to unambiguously extract free energy profiles from 
experimental data. In order to circumvent some of the problems Schlierf and Rief have used 
EqJT0]to analyze the load-dependent experimental data on unfolding of ddFLN4 and extracted 
an effective one dimensional free energy surface F(x) without making additional assumptions. 
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The results s. 
state region 



rowed that the effective free energy profile is highly anharmonic near the transition 



24 1 



conducted by ramping force over time 



23 



. In this mode the load on the molecule or 



the complex increases with time. When the force increases beyond a threshold value, unbinding 
or bond-rupture occurs. Because of thermal fluctuations the unbinding events are stochastic 
and as a consequence one has to contend with the distribution of unbinding forces. The time- 
dependent nature of the force makes the barrier crossing rate also dependent on t. For a single 
barrier crossing event with a time- dependent rate k(t), the probability of the barrier crossing 
event being observed at time t is P{t) = k(t)S(t) where the survival probability, that the 
molecule remains folded, is given as S(t) = exp ^— J * dTk(r)\ . 

When the molecule or complex is pulled at a constant loading rate (ry) the distribution 
(P(f)) of unfolding forces is asymmetric. The most probable r^-dependent unfolding force (/*) 
is often used to determine the TS location of the underlying energy landscape with the tacit 
assumption that the TS is stationary. When rj = df /dt is constant, the probability of observing 
an unfolding event at force / is written as, 



= -*:(/) exp 



' f f 1 

. Jo Tf 



(12) 



The most probable unfolding force is obtained from dP(f)/df\ 



f=f* 



0, which leads to 



r 



k B T 
Ax(f* 

( 



■{log 



TfAx(f* 



v D {f*)e- pAF ° {n k B T 



+ log 



1 + f 



Ax'(r) 



Ax(f*) 
- tog(e^>,<e-^> { } ) 



k B T 



\ 



V 



Ax(f*) v D (f*)Ax(f*) 



J 



(13) 



where AFq = F (x ts (f)) — F (xo(f)), t denotes differentiation with respect to the argument, 
Ax(f) = x ts (f) — Xo(f) is the distance between the transition state and the native state, and 
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45 



46|] 



uu(f) = uj (f)u ts (f) /27T7- Note that AFq, u d , and Ax depend on the value of / 
Because /* changes with r/, Ax obtained from the data analysis should correspond to a value at 
a certain /*, not a value that is extrapolated to /* = 0. Indeed, the pronounced curvature in the 



12 



plot of /* as a function of logr^ makes it difficult to obtain the characteristics of the underlying 
energy landscape using data from dynamic force-spectroscopy without a reliable theory or a 
model. If AFq, vd, and Ax are relatively insensitive to variations in force, the second term 
on the right-hand side of EqJT3l would vanish, leading to /* oc (ksT/Ax) log 77 [50|. If the 
loading rate, however, spans a wide range so that the force-dependence of AFq, ud, and Ax are 
manifested, then the resulting /* can substantially deviate from the linear dependence to the 
logry. Indeed, it has been shown that for a molecule or a complex known to have a single free 
energy barrier, the average rupture force / ~ (\ogrf) u where the effective exponent v < 1. The 
precise value of v depends on the nature of the underlying potential and is best treated as an 



67|]. 



adjustable parameter 

Provided AFq, ujj, and Ax are assumed constant the applicability of Kramers' equation can 
be checked. In principle, the use of Kramers' equation is justified if the effective energy barrier is 
at least greater than the thermal energy k B T (A* = AFj - f*Ax > k B T) at the most probable 
unfolding force /*. Substitution of /* from Eq{13] leads to 

A* = -A; B Tlog^^. (14) 

The condition that A* > k B T is satisfied as long as r $ < Vd^t^. — r }- For the set of parameters, 
k B T = 4.14 pN -nm, Ax ~ 1 nm, and ujj ~ 10 6 s -1 , the critical loading rate rj ~ 10 6 pN/s. The 
typical loading rate used in force experiments is several orders of magnitude smaller than this 
value. Therefore, it is legitimate to interpret the force experiments using the formalism based 
on the Kramers' equation. If r/ > r% as is typically the case in steered molecular dynamics 
simulations |75j , the forced-unfolding process can no longer be considered a thermally activated 
barrier crossing process. At such high loading rates average (not the same as /*) rupture force 
(/) grows (non-logarithmically) as v 1 ^ 2 where v is the pulling speed 52]. 



MEASUREMENT OF ENERGY LANDSCAPE ROUGHNESS 

In the presence of roughness we expect that the unfolding kinetics deviates substantially from 
an Arrhenius behavior. By either assuming a Gaussian distribution of the roughness contribution 
Fx (P(Fi) oc e~ F i/ 2e2 ) or simply assuming (3 Fx <ti 1 and (Fx) = 0, (F 2 ) = e 2 , one can further 
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simplify EqJTTIto 

\ogk(f)/k = -(AF* - / • Ax)/k B T - e 2 /k 2 B T 2 . (15) 

This relationship suggests that roughness scale e be extracted if log k(f) is measured over range 
of temperatures. Variations in temperature also result in changes in the viscosity, 77, and be- 
cause k^ 1 oc 77, corrections arising from the temperature-dependence of 77 has to be taken into 
account in interpreting the experiments. It is known that 77 for water varies as exp(A/T) 
over the experimentally relevant temperature range (5°C < T < 50°C) 76]. Thus, we expect 
log k(f, T) = a + b/T — e 2 /T 2 . The coefficient of the 1/T 2 term can be quantified by performing 
force-clamp experiments at several values of constant temperatures. In addition, the robustness 
of the HT theory can be confirmed by showing that e 2 is a constant even if the coefficients a 
and b change under different force conditions [41] . The signature of the roughness of the under- 
lying energy landscape is uniquely reflected in the non-Arrhenius temperature dependence of 
the unbinding rates. Although it is most straightforward to extract e using Eqfl5]no roughness 
measurement, to the best of our knowledge, has been performed using force clamp experiments. 

To extract the roughness scale, e, using dynamic force spectroscopy (DFS) in which the force 
increase gradually in time, an alternative but similar strategy as in force clamp experiments 
can be adopted. A series of dynamic force spectroscopy experiments should be performed as 
a function of T and rj so that reliable unfolding force distributions are obtained. Since a 
straightforward application of Eq{13] is difficult due to the force-dependence of the variables in 
EqJTBI one should simplify the expression by assuming that the parameters Ax(f), AF$(f), 
and U£)(f), depend only weakly on /. If this is the case then the second term of EqJTBJcan be 
neglected and EqJTBI becomes 

/ „ _ l 0gry + _ l 0g „ r + 3^. (16) 



One way of obtaining the roughness scale from experimental data is as follows 42j. From the 
/* vs log 77 curves at two different temperatures, T\ and T 2 , one can obtain r^{T\) and r/(T 2 ) 
for which the /* values are identical. By equating the right-hand side of the expression in EqJTBI 
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at T\ and T 2 the scale e can be estimated 4l|, |42j as 
2 Ax(T x )k B T x x Ax(T 2 )k B T 2 



€' 



&x{T x )k B T x - Ax(T 2 )k B T 2 
A pt LJ\ + ^ W r/CrQAsCT!) _ fcBT 2 r / (T 2 )Ax(T 2 ) 



Ax(Tx) Ax(T 2 )7 Ax(TO b v D {T x )k B T x Ax(T 2 ) * u D (T 2 )k B T 2 
In a recent study Nevo et. al. 



.(17) 



used DFS to measure e for a biomolecular protein 
complex consisting of nuclear import receptor importin-/? (imp-/?) and the Ras-like GTPase 
Ran that is loaded with non-hydrolyzable GTP analogue (Figj3]-C). The Ran-imp-/3 complex 
was immobilized on a surface and the unbinding forces were measured using the AFM at three 
values of r/ that varied by nearly three orders of magnitude. At high values of rj the values of 
/* increases as T increases. At lower loading rates (ry < 2 x 10 3 pN/s), however, /* decreases 
as T increases (see Figj3]-B). The data over distinct temperatures were used to extract, for the 
first time, an estimate of e. The values of /* at three temperatures (7, 20, 32°C) and EgflTl 
were used to obtain e ~ 5 — Qk B T. Nevo et. al. explicitly showed that the value of e was 
nearly the same from the nine pairs of data extracted from the /* vs logrj curves. Interestingly, 
the estimated value of e is about 0.2AFJ where AFq is the major barrier for unbinding of 
the complex. This shows that, for this complex the free energy in terms of a one-dimensional 
coordinate, resembles the profile shown in FigfSJ It is worth remarking that the location of the 
transition state decreases from 0.44 nm at 7°C to 0.21 nm at 32°C. The extracted TS movement 
using the roughness model is consistent with Hammond behavior (see below). 



EXTRACTING TS LOCATION (Ax) AND UNFOLDING RATE (k) USING THEORY 

OF DFS 

The theory of DFS, /* ps ^ log 77 + ^ log -^f, is used to identify the forces that desta- 
bilize the bound state of the complex or the folded state of a specific biomolecule. A linear 
regression provides the characteristic extension Ax at which the molecule or complex ruptures 
(more precisely Ax is the thermally averaged distance between the bound and the transition 
state along the direction of the applied force). It is tempting to obtain the zero force unfolding 
rate k from the intercept with the abscissa. Substantial errors can, however, arise in the ex- 
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trapolated values of Ax and k to the zero force if /* vs logry is not linear, as is often the case 
when rf is varied over several orders of magnitude. Nonlinearity of [/*, logr^] curve arises for 
two reasons. One is due to the complicated molecular response to the external load that results 
in dramatic variations in Ax. Like other soft matter, the extent of response (or their elasticity) 



depends on r/ 



1Q. 



The other is due to multiple energy barriers that are encountered in 



the unfolding or unbinding process 



3l|- 



If the TS ensemble is broadly distributed along the reaction coordinate then the molecule can 
adopt diverse structures along the energy barrier depending on the magnitude of the external 
load. Therefore, mechanical force should grasp the signature of the spectrum of the TS confor- 
mations for such a molecule. Mechanical unzipping dynamics of RNA hairpins whose stability 
is determined in terms of the number of intact base pairs is a good example. The conformation 
of RNA hairpins at the barrier top can gradually vary from an almost fully intact structure 
at small forces to an extended structure at large forces. Under these conditions the width of 
the TSE is large. The signature of diverse TS conformations manifests itself as a substantial 
curvature over the broad variations of forces or loading rates. Meanwhile, if the unfolding is 
a highly cooperative all-or-none process characterized by a narrow distribution of the TS, the 
nature of the TS may not change significantly. 

The linear theory of DFS is not reliable if the TSE is plastic because it involves drastic 
approximations of the EqJT3l From this perspective it is more prudent to fit the the experimental 
unbinding force distributions directly using analytical expressions derived from suitable models 



671 ] (see also APPENDIX). If such a procedure can be reliably implemented then the extracted 



parameters are likely to be more accurate. Solving such an inverse problem does require assuming 
a reduced dimensional representation of the underlying energy landscape which cannot be a 
priori justified. 



DIRECT ANALYSIS OF THE UNBINDING FORCE DISTRIBUTION 

Although the direct fit of the measured unbinding force distributions to the 
P(f)\= ^k(f) exp(— J* df'^jk(f'))j is numerically more complicated than using EqUB], it 
avoids a potentially serious error due to the approximation in going from Eq{13] to EqJT6l It 
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is, however, difficult to unearth the energy landscape characteristics of the molecule from P(f) 
alone because P(f) cannot be expressed in a simple functional form. By approximating P(f) 
by a Gaussian we can dissect the effect of molecular topology and sequence on P(f). Note that 
in general the measured P(f) is asymmetric so that the Gaussian approximation may be of 
limited utility. For purposes of illustrating the nuances in the data analysis, we approximate 
the measured P(f) by a Gaussian using Taylor expansion, 



P(f) 



exp 
exp 

exp 



' d/ M/>_ log *(/) 



/v 



, l)-/3/Ax-log- 

Tf pAX Tf 
_ rf-nk B T/Ax ^2 



r f k B T 



exp 



(f-ff 
2a) 



where k(f) = Ke^^ x and k = k e~ f3AF oe~ f3 ' 2e2 . In going from the first to the second line we 
assume that Ax and k are independent of /. Two conclusions can be drawn from Eqfl8l (i) Both 
/* ~ /(= Tf/n — kBT/Ax) and the width of the force distribution cr/(= ( ^^J ) 1 / 2 ) increase 
as Tf increases, (ii) As T increases, /* decreases while Of increases, provided Ax and k are 
independent function ofT orrf. 

Recently, Schlierf and Rief (SR) analyzed the unfolding force distributions (with r/ fixed) of 
a single domain of Dictyostelium discoideum filamin (ddFLN4) at five different temperatures to 



infer the underlying one dimensional free energy surface [43|. Going from the measured data 
(P(f)) to the underlying energy landscape is an inverse problem that requires a specific model. 
Using the Bell model (k(f) = nexp (fAx/ksT)) and an approximation that the Arrhenius pre- 
factor k is a constant (k = A^exp (— AF^/Zc^T) where k = 10 7 s _1 ) that is independent of 
temperature (i.e., assuming zero roughness), Schlierf and Rief extracted the energy landscape 
parameters (Ax, AF*) for each force distribution. Somewhat surprisingly, the SR analysis 
indicated that the rate (with relatively large errors) for populating an intermediate at zero force 
(k) increases as T is lowered. Their fits indicate that the location of the TS must increase as T 
increases. Therefore, they concluded that ddFLN4 protein exhibits an anti-Hammond behavior, 
i.e., the position of the TS moves towards the force-stabilized ensemble of states (unfolded 
states). 
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Visual inspection of P(f) at different temperatures in Figure 2 of ref. 43( by SR confirms that 



/* decreases but o~f decreases as T increases. This means that k x Ax must increase faster than 
the increase of ksT (Note that Of = ( jj^^ j )• Although the data were fit well by assuming 
that AF^ is temperature dependent AF^ while k$ is a constant, an alternative explanation was 
also possible as recognized by SR. In the alternate model based on energy landscape roughness 



we assume that AF* is constant but k depends on temperature [4l|. The constancy of AF* 
is a posteriori justified in light of the SR analysis. By adopting the the HT roughness model 
SR showed the data can be fit using e = 4fcgT for ddFLN4 unfolding. We believe that SR's 
main interpretation (temperature softening effect, or anti-Hammond behavior) still holds even 
within the framework of the roughness model because their conclusion is entirely based on the 
temperature dependence of Ax. The roughness model has the advantage that the data can be 
fit essentially with only one parameter. Moreover, experiments on protein folding have often 
been interpreted using changes in the prefactor due to e. 

To further confirm SR's conclusion we suggest an independent measurement of the precise 
values of Ax by varying not only the temperature but also the loading rate, as is demonstrated 



by Nevo et. al. (Figj3]-B) 42]. The slopes of FigfS] at different temperatures succinctly show 
that imp-/3-RanGppNHp complex exhibit the more common "Hammond" -behavior. Instead 
of using Eq{T7J the energy landscape roughness e can be extracted from the multiple sets of 
P(f) at various (T,rf) conditions. Non-Arrhenius behavior characterized particularly by 1/T 2 
is the unique feature of the roughness scale of the energy landscape, and is not affected by TS 



movements. More explicitly, P(f) should be modeled as 



= K(T] rf) e fAx{T;r f )/k B T 



_ j f df' K ( T i r /) c .fAx(T;r f )/k B T 



(19) 



with k(T\ rf) = V£)(T)e~ AFt / kBT e~ t2 / k B T2 . The overall shape of P(f) is determined by Ax(T; ry) 
and k(T; rf). The multiple force distributions, generated at the same loading rate but at different 
temperatures, can be used to fit k(T;tj) using 

log k{T) =a + b/T- e 2 /T 2 . (20) 

Such experiments would permit unambiguous extraction of roughness just as in the conceptually 
simpler force-clamp experiments (compare with Eq JT5|) . 
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The SR results illustrate the potential difficulties in uniquely extracting model-free energy 
landscape parameters from dynamic force spectroscopy data. It is worth mentioning that Scalley 
and Baker have used similar arguments about the anomalous temperature dependence of refolding 
rates of proteins [ttJ. They showed that the Arrhenius behavior of kinetics is retrieved when 
the protein stability is corrected for the temperature dependence. 



MECHANICAL RESPONSE OF HARD (BRITTLE) VERSUS SOFT (PLASTIC) 

BIOMOLECULES 

Regardless of the model used, it is obvious that the lifetimes of a complex decrease upon 
application of force. The compliance of the molecule is determined by the location of the TS, 
and hence it is important to understand the characteristics of the molecule that determine the 
TS. As we pointed out, many relevant paramters have strong dependence on /, r/, or T. Thus, it 
is difficult to extract the energy landscape parameters without a suitable model. In this section 

of a biomolecule using 
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we illustrate two extreme cases of mechanical response 
one-dimensional energy profiles. In one example the location of the TS does not move with force 
whereas in the other there is a dramatic movement of the TS. In the presence of force /, a given 
free energy profile Fq(x) changes to F(x) = Fq(x) — fx. The location of the TS at non-zero 
values of / depends on the shape of barrier in the vicinity of the TS. Near the barrier (x ~ Xt s ) 
we can approximate F (x) as 

F (x) « F (x ts ) - ^{x ts )(x - x ts ) 2 + ■■■ . (21) 

In the presence of force the TS location becomes Xt s (f) = Xts — F"{x t ) • ^ ^ ne transition barrier 
in F (x) is sharp {x ts FQ(x ts ) ^> f) then we expect very little force- induced movement in the 
TS. We refer to molecules that satisfy this criterion as hard or brittle. In the opposite limit 
the molecule is expected to be soft or plastic so that there can be dramatic movements in the 
TS. We illustrate these two cases by numerically computing r/-dependent P(f) using EqflQland 
EqH2]for two model free energy profiles. 
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Hard response: A nearly stationary TS position (independent of /) is realized if the energy 
barrier is sharp (Eqj2Tl). We model F (x) using 

F (x) = -V \(x + l) 2 - £ 2 | with x > (22) 

where Vq = 28 pN/nm and f = 4 nm. The energy barrier forms at x = 1 nm and this position 
does not change much even in the presence of force as illustrated in FigJU-A. In dynamic force 
spectroscopy the free energy profiles drawn at constant force may be viewed as snapshots at 
different times. The shape of the unbinding force distribution depends on rf. We calculated 
P(f) numerically using Eq{10] and Eq{13] (see FigfJJ-B). Interestingly, a plot of the the most 
probable force /* obtained from P(f) does not exhibit any curvature when 77 is varied over 
six orders of magnitude (FigJU-C). Over the range of 77 the [/*,logr/] plot is almost linear. 
The slight deviation from linearity is due to the force- dependent curvature near the bound state 
(u)b(f)). From the slope we find that Ax ~ 1 nm which is expected from EqfS^J In addition, we 
obtained from the intercept in FigfJJ-C that k = 1.58 s -1 . The value of k[= k(f = 0)] directly 
computed using EqJTU] is k = 1.49 The two values agree quite well. Thus, for brittle 

response the Bell model is expected to be accurate. 

Soft response: If the position of the TS sensitively moves with force the biomolecule or the 
complex is soft or plastic. To illustrate the behavior of soft molecules we model the free energy 
potential in the absence of force using 

F (x) = -V exp (-fx) with x > (23) 

where Vq = 82.8 pN ■ nm and f = 4 (nm) -1 . The numerically computed P(f) and [/*,logr/] 
plots are shown Figj5j The slope of the [/*,logr/] plot is no longer constant but increases 
continuously as 77 increases. The extrapolated value of k to zero / varies greatly depending on 
the range of 77 used. Even in the experimentally accessible range of 77 there is curvature in the 
[/*, logr/] plot. Thus, unlike the parameters (Ax, k) in the example of a brittle potential, all 
the extracted parameters from the force profile are strongly dependent on the loading rate. As 
a result, in soft molecules the extrapolation to zero force (or minimum loading rate) is not as 
meaningful as in hard molecules. Note how the extracted Ax (see the inset of Figj5]-A) changes 
as a function of rf. For soft (plastic) molecules, the extracted parameters using the tangent at 
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a certain r'j are not the characteristics of the free energy profile in the absence of the load, but 
reflect the features for the modified free energy profile tilted by (/*)° at r°j. 

In practice, biomolecular systems lie between the two extreme cases (brittle and plastic). In 
many cases the [/*, log rf\ appears to be linear over a narrow range of r/. The linearity in narrow 
range of logry, however, does not guarantee the linearity under broad variations of loading rates. 
In order to obtain energy landscape parameters it is important to perform experiments at Tf 
as low as possible. The brittle nature of proteins (lack of change in Ax) inferred from AFM 
experiments may be the result of a relatively large ry (« 1, OOOpN/s). On the other hand, only 
by varying rj over a wide range the molecular elasticity of proteins and RNA can be completely 



described. Indeed, we showed that even in simp 



e RNA hairpins the transition from plastic to 



brittle behavior can be achieved by varying rj 44]. The load-dependent response may even have 
functional significance. 



HAMMOND/ANTI-HAMMOND BEHAVIOR UNDER FORCE AND 
TEMPERATURE VARIATIONS 

The qualitative nature of the TS movement with increasing perturbations can often be an- 
ticipated using the Hammond postulate which has been successful in not only analyzing a large 
class of chemical reactions but also in rationalizing the observed behavior in protein and RNA 
foldings. The Hammond postulate states that the nature of TS resembles the least stable species 
along the reaction pathway. In the context of forced-unfolding it implies that the TS location 
should move closer to the native state as / increases. In other words Ax should decrease as 
/ is increased. Originally Hammond postulate was introduced to explain chemical reactions 



involving small organic molecules 
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791 ] . Its validity in biomolecular folding is not obvious 
because there are multiple folding or unfolding pathways. As a result there is a large entropic 
component to the folding reaction. Surprisingly, many folding processes are apparently in accord 



with the Hammond postulate 
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If the extension is an appropriate reaction coordinate 
for forced unfolding then deviations from Hammond postulate should be an exception than the 
rule. Indeed, anti-hammond behavior (movement of the TS closer to more stable unfolded state 
as T increases) was suggested by SR based on a model used to analyze the AFM data. The 
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simple free energy profiles used in the previous section (EqfjS] and Eq j2"51) can be used to verify 
the Hammond postulate when the external perturbation is either force or temperature. First, 
for the case of hard response the TS is barely affected by force, thus the Hammond or anti- 
Hammond behavior is not a relevant issue when unbinding is induced by /. On the other hand, 
for the case of soft molecules Ax always decreases with a larger force. The positive curvature 
in [f*,logrf] plot is the signature of the classical Hammond-behavior with respect to /. 

As long as a one dimensional free energy profile suffices in describing forced-unfolding of 
proteins and RNA the TS location must satisfy Hammond postulate. In general, for a fixed 
force or r/, Ax can vary with T. The changes in Ax with temperature can be modeled using 
T-dependent parameters in the potential. To evaluate the consequence of T-variations we set 

f = £o + a(T - 300K) (24) 

for both free energies in Eqj22] and Eqj23l Depending on the value of a the position of the TS 
can move towards or away from the native state. We set a = ±0.1 for both the hard and soft 
cases. The numerically computed [/*,logr/] plots are shown in FigJHl One interesting point is 
found in soft molecule that exhibits Hammond behavior. For wide range of Tf, Ax decreases as 
T increases. However, the most probable unbinding force /* at low temperatures can be larger 
or smaller than /* at high temperatures depending on the loading rate (see upper-right corner 
of Figj6]). A very similar behavior has been observed in the forced-unbinding of Ran-imp-/3 



complex 42| (see also Figj3]-B). Although the model free energy profiles can produce a wide 
range of behavior depending on T, /, and r/ the challenge is to provide a structural basis for 
the measurements on biomolecules. 



MULTIDIMENSIONALITY OF ENERGY LANDSCAPE COUPLED TO "MEMORY" 

AFFECT THE FORCE DYNAMICS 

The natural one dimensional reaction coordinate in mechanical unfolding experiments is the 
extension x of the molecule. However, local rupture events can couple to x, which would require 
a multidimensional description. For example, consider the case of forced-unfolding of a nucleic 
acid hairpin. The opening of a given base pair is dependent not only on its strength, which 
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resists unfolding, but also on the increase in the available conformational space which favors 
unfolding. In this case fluctuations in the collective coordinates, which describe the local events, 
are coupled to the global coordinate x. Thus, suitable fluctuations in the local coordinate r has to 
occur before an increase in the extension is observed. Such a coupling between local coordinates 



and global observable arises naturally in many physical situations [83(. In the context of ligand 
binding to myoglobin Zwanzig first proposed such a model by assuming that reaction (binding) 
takes place along ^-coordinate and at the barrier top (x = x^) the reactivity is determined by 



the cross section of bottleneck described by r-coordinate [84|. We adopt a similar picture to 
describe the modifications when such a process is driven by force. First, consider the Zwanzig 
case i.e, r/ = 0. The equations of motion for x and r are, respectively, 

d 2 x Ax dU(x) _, , . 

J = -1-r + F r (t). (25) 

The Liouville theorem (-£ = 0) describes the time evolution of probability density, p(x,r,t), as 

dp dp d ( dx \ d ( dr \ „ 

i = i + + a (26) 

By inserting of Eqj25]to Eql26]and neglecting the inertial term, (mCf ), and averaging over the 
white-noise spectrum, and the fluctuation-dissipation theorem ((F x (t)F x (t')) = 2(k B T5(t — t'), 
(F r (t)F r (t')) = 2X6 5 {t — t') where (r 2 ) = 6) leads to a Smoluchowski equation for p(x,r,t) in 
the presence of a reaction sink. 

0~p 

— = C x p + C r p - k r r 2 5(x - x ts )p (27) 

where C x = D£ (± + and C r = Xd£ (f + |). Integrating both sides of Eq(27] 

using f dxp(x,r,t) = C(r,t) leads to 

Qn 

— = £ r C(r, t) - k r r 2 p(x ts , r, t). (28) 

By writing p(x ts ,r,t) = (p x (x ts )C(r,t) where <t>(x ts ) should be constant as (p(x ts ) = 
e -u( Xts )/k B T i j dxe -u{x)/k B T w J^M e -(u(xu)-u(x b ))/k B T ) EqEHlbecomes 

f)T< _ 

— = C r C(r,t)-kr 2 C(r,t). (29) 
at 
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where k = k r J e ( f/ ( :Cts ) u(x b ))/k B T^ j n a jj likelihood A; r reflects the dynamics near the 



barrier, so we can write = K^^-e AU / k B T where k describes the geometrical information 

' 27T7 ° 

of the cross section of bottleneck. Now we have retrieved the equation in Zwanzig's seminal 
paper where the survival probability (£(£) = drC(r, t)) is given under a reflecting boundary 



condition at r = and Gaussian initial condition C(r,t = 0) ~ e r ' . By setting C(r,t) = 
exp (v(t) — /z(i)r 2 ), Eqj29l can be solved exactly, leading to 

u'(t) = -2X9^(t) + \ 

^(t) = -A\9fx 2 (t) + 2\fx(t) + k. (30) 

The solution for fi(t) is obtained by solving ^ a z_f^ d z a2 = t, and this leads to 

fi(t) l( +s (S+l)-(S-l)E 



(i(p) 2 [ (S + 1) + (S-1)E 

M,n ^ i f(S + l) + (S-l)EY 1/2 
u(t) = --(S - 1) + log l± } —j (31) 

with /i(0) = 1/28. The survival probability, which was derived by Zwanzig, is 

where S = (l + ^ff' 2 and E = e~ 2XSt . 

We wish to examine the consequences of coupling between local and global reaction coordi- 
nates under tension. In order to accomplish our goal we solve the Smoluchoski equation in the 
presence of constant load. In this case, k in EqE! should be replaced with ke t( - r f Ax/kBT) . EqJM, 
however, becomes hard to solve if the sink term depends on t. Nevertheless, analytical solutions 
can be obtained for special cases of A. If A — > oo, dH(t)/dt = — A;^e i( - T ' /A:r ' /fcsT ^S(t), and hence, 

' ' (33) 



£(/) = exp (-k9 df^-e^^ = exp ^-^^( e ^/ fe - T - 1)) 



Using the rupture force distribution P(f) = — and dP ^P | /=/* = 0, one can obtain the most 
probable force 
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If A is small (A -> 0) then C(r,t) = exp [- /* dtkr 2 exp (t x r f A/k B T)} 



exp 



.krkBT,trfAx/k B T _ ft ith th ^i ti al distribution of e" r2 / 2e . Thus, 



£(/) 



y ' [ r f Ax 




2 exp [-r 2 /2fl] 




± e ^ {el ^r_ 1)+ i_y>\ (3B) 

Note that if r f -»• we recover Zwanzig's result £(t) ~ (1 + 2fc0t)~ 1 / 2 . Using P(/) = -dE(f)/df 

- 3/2 fc 



pm = 1 f kkBT ( e f^/ kB T + 

U) V2^\r f Ax [e i) + 26) 

dP{f)/df\ f=f * = gives 



. e /A*A fl T (36) 



^ , f Z' r / As \ ( , 2k9k B T 



^t'^fej^wil' (37) 

in which > (1 + 26 1 ) ke ^ T since /* > 0. This shows that /* vs rj has a different form when 
A — > from the one when A — ► oo. The deviation of EqfST] from the conventional relation is 
pronounced when [77 - (1 + 28)^f] -> + . 

At present, experimental data have been interpreted using only one dimensional free energy 
profiles. The meaning and the validity of the extracted free energy profiles has not been estab- 
lished. At the least, this would require computing the force-dependent first passage times using 
the "experimental" free energy profile assuming that the extension is the only slowly relaxing 
variable. If the computed force- dependent rates (inversely proportional to first passage times) 
agree with the measured rates then the use of extension of the reaction coordinate would be 
justified. In the absence of good agreement with experiments other models, such as the one 
we have proposed here, must be considered. In the context of force-quench refolding we have 
shown (see below) that extension alone is not an adequate reaction coordinate. For refolding 
upon force-quench of RNA hairpins, the coupling between extension and local dihedral angles, 
which reports on the conformation of the RNA, needs to be taken into account to quantitatively 
describe the refolding rates. 
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CONCLUSIONS 

With the advent of single molecule experiments that can manipulate biomolecules using 
mechanical force it has become possible to get a detailed picture of their energy landscapes. 
Mechanical folding and unfolding trajectories of proteins and RNA show that there is great 



diversity in the explored routes 
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. In certain well defined systems with simple native 



states, such as RNA and DNA hairpins, it has been shown using constant force unfolding that 



the hairpins undergo sharp bistable transitions from folded to unfolded states 2J, |25|, |38( . From 
the dynamics of the extension as a function of time measured over a long period the underlying 
force dependent profiles have been inferred. The force-dependent folding and unfolding rates 
and the unfolding trajectories can be used to construct the one-dimensional energy landscape. 
In a remarkable paper 25j , Block and coworkers have shown that the location of the TS can be 
moved, at will, by varying the hairpin sequence. The TS was obtained using the Bell model by 
assuming that the Ax is independent of /. While this seems reasonable given the sharpness of 
the inferred free energy profiles near the barrier top it will be necessary to show the Ax does 
not depend on force. 

The fundamental assumption in inverting the force-clamp data is that the molecular extension 
is a suitable reaction coordinate. This may indeed be the case for force-spectroscopy in which 
the response of the molecule only depends on force that is coupled to the molecular extension, 
which may well represent the slow degrees of freedom. The approximation is more reasonable 
for forced-unbinding. It is less clear if it can be assumed that extension x is the appropriate 
reaction coordinate when refolding is initiated by quenching the force to low enough values 
such that the folded state is preferentially populated. In this case the dynamic reduction in 
x can be coupled to collective internal degrees of freedom. In a recent paper [44j we showed, 
in the context of force-quench refolding of a RNA hairpin, that the reduction in x is largely 
determined by local conformational changes in the dihedral angle degrees in the loop region. 
Zipping by nucleation of the hairpin with concomitant reduction in x does not occur until the 
transitions from trans to gauche state in a few of the loop dihedral angles take place. In this 
case, one has to consider at least a two dimensional free energy landscape. FigJT] clearly shows 
such a coupling between end-to-end distance (R) and the dihedral angle degrees of freedom. 
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The " correctness" of the six dihedral angles representing the conformation of RNA hairpin loop 
region i — 19, . . . 24) is quantified using (1 — cos (0j — <f>°)), where 4>° is the angle value in the 
native state and (. . .) is the average over the six dihedral angles. (1 — cos {(f) — <f)°)) = signifies 
the correct dihedral conformation for the hairpin loop region. Once the "correct" conformation 
is attained in the loop region, the rest of the zipping process can easily proceed as we have 



shown in 



441 ]. Before the correct loop conformation being attained, RNA spends substantial 



time in searching the conformational space related to the dihedral angle degree of freedom. The 
energy landscape ruggedness is manifested as in FigJ7]when conformational space is represented 
using multidimensional order parameters. The proposed coupling between the local dihedral 
angle degrees of freedom and extension (global parameter) is fairly general. A similar structural 
slowing down, due to the cooperative link between local and global coordinates, should be 
observed in force-quench refolding of proteins as well. 

The most exciting use of singe molecule experiments is in their ability to extract precise 
values of the energy landscape roughness e by using temperature as a variable in addition to /. 
In this case a straightforward measurement of the unbinding rates as a function of / or rf can 
be used to obtain e without having to make any assumptions about the underlying mechanisms 
of unbinding. Of course, this involves performing a number of experiments. In doing so one can 
also be rewarded with diagram of states in terms of / and T 45j. The theoretical calculations 
and arguments given here also show that the power of single molecule experiments can be 
fully realized only by using the data in conjunction with carefully designed theoretical and 
computational models. The latter can provide the structures that are sampled in the process of 
forced-unfolding and force-quench refolding as was illustrated for ribozymes and GFP It is 
likely that the promise of measuring the energy landscapes of biomolecules, almost one molecule 
at a time, will be fully realized using a combination of single molecule measurements, theory, 
and simulations. Recent studies have already given us a glimpse of that promise with more to 
come shortly. 
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APPENDIX: DYNAMIC FORCE SPECTROSCOPY IN A CUBIC POTENTIAL 

Assuming that the Bell model gives a correct description of forced-unbinding it was shown 
by Evans and Ritchie that the most probable unbinding force /* ~ log r y assuming that 
the TS location is independent of rf |50j. Deviations from logarithmic dependence of /* on re- 
occurs if the assumptions of the Bell model is relaxed as was first shown by Dudko et. al. [511 ] . 
In this Appendix we give a simple derivation of the result for the cubic potential, the simplest 
one that has a potential well (bound state) and a barrier for which we can analytically formulate 
the dynamic force spectroscopy theory. The general form of the cubic potential is, 

F (x) = ax 3 + bx 2 + cx + d with a < 0. (38) 

In the presence of constant force /, F(x) = Fq(x) — fx can have a finite free energy barrier if 
the two roots of F'(x) = 3ax 2 + 2bx + c — / = 0, namely, 



-2b ± ^Ab 2 - Uajc - /) 
± 6a 

are real. Provided b 2 — 3a(c — /) > 0, r_ and r + correspond to x ts and Xb, respectively. Note that 
both x ts and Xb are functions of / in the cubic potential and so is Ax = x ts —Xb = — ^ . 

The distance between x ts and Xb decreases as / grows and vanishes when the force reaches a 
critical force f c = c— where the free energy barrier also becomes zero. The /-dependent free 
energy barrier(AF^) is calculated by AF*(f) = F(xt s ) — F( 

AF t (f) = a(ri - rj.) + &(r^ - r\) + c(r_ - r+) - /(r_ - r+) 

= U c e 3 / 2 . (39) 

where e = 1 — f / f c . The curvatures at the barrier (x = x ts ) and the bottom (x = xj,) of the 
potential, that are used in the Kramers' rate expression, are calculated using F"(x)\ x=XtgtXb = 
(6ax + 2b)\ x=Xu , Xb = mu) 2 tsh . 

-\2af c 1-4- 



fc 

fi c e 1/4 (40) 
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Note that (i) Ax = muj 2 j{— 3a) is satisfied for all < / < f c , and (ii) u ts = u b , the point 
symmetry around the inflection point x c that satisfies F'(x c ) = F"(x c ) = 0, are the unique 
properties of the cubic function for any parameter set (a, b, c, d, f). 

This property is no longer valid if the free energy function is modeled using a higher order 
polynomial or special functions. What is worse for the higher order polynomial is that roots of 
F'(x) = are not easily found, and that if order is higher than 6 there is no general solution. 
Thus, if higher polynomials are needed to fit the free energy profile, even with the assumption 
that the extension is an appropriate reaction coordinate, then analytically tractable solutions 
are not possible. It may be argued that any smooth potential may be locally approximated 
using a cubic potential, and hence can be used to analyze the experimental data. 

Using EqsIIH H21 M, and M we get 

, , nit 1 ' 2 



k 



27T7 



exp (-U c e 3/2 /k B T) 



(41) 



and 



Pie) 



exp (-U c e 3/2 / k B T) exp ' " c 



Tf 27T7 

Afe 1/2 exp 



dee l/2 e -U c e^/k B T 



u c t 3 ' 2 k B Tf c n 



c -U c e 3 / 2 /k B T 



(42) 



where M 



k B T 37rU c jr f 

exp ( ^J/^ e~ Uc l kBT ) . The most probable force /* is obtained using 



2-ir-yr j " \ 3irU c 'yrf 



dP(f)/df\f=f* = 0, which establishes the relation with respect to e*(= 1 — /*/ f c ) as 



U, 



' / e *\3/2 
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log 
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r f 



(43) 



If /* < f c which also satisfies the condition Stf > (e*) 2/3 , 



k B T 



r «/c 



k B T 



log 
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WJ c k B T 



x r f 
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(44) 



f 



By defining r 
the above expression as 



k B T>y 



e" c/ "' ii " as the minimum loading rate that gives /* > 0, one can rewrite 



r^fc 



k B T Tf 
log — — 



2/3' 



(45) 
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The importance of the result in Eql4"5l was first emphasized by Dudko et. al. 

Two comments about the announced deviation from the usual logarithmic dependence of /* 
on Tf are worth making. (1) Provided log -C 1, Eq|45] becomes 



which is the typical /* vs \ogrf relation. It is of particular interest to see if the condition 



be really distinguished using experimental data in which r/ cannot be easily varied by more 
than three (at best) orders of magnitude. Thus, the reliability of the parameters obtained using 
either formulation is difficult to assess independently. Nevertheless, the reexamination of the 
logarithmic dependence of /* on rj shows that one should be mindful of the assumption that 
the location of the TS does not depend on f or 77. 




(46) 



~fjT~ log <C 1 is satisfied in proteins or RNA. (2) Because /* ~ logrf U in both the 
Evans-Ritchie [y = 1) [50] and Dudko {y = |) formulations [67] it is unclear whether they can 
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Figure Caption 

Figure [TJ : Single molecule experiments using mechanical force. On the left we show 
a schematic setup of LOT experiments in which a ribozyme is mechanically unfolded. The 
Tetrahymena ribozyme is sandwiched between two RNA/DNA hybrid handles that are linked 
to micron-sized beads. The structures of a few of the displayed intermediates were obtained us- 



ing simulations of the Self-Organized Polymer (SOP) model representation [85( of the ribozyme. 
The sketch on the right shows a typical AFM setup in which we illustrate unfolding of GFP. 
The intermediate structures, that are shown, were obtained using simulations of the SOP model 
for GFP. 

Figure : Caricature of the rough energy landscape of proteins and RNA that fold in an 
apparent "two-state" manner using extension x, the coordinate that is conjugate to force /. 
Under force /, the zero-force free energy profile (Fq(x)) is tilted by / x x and gives rise to the 
free energy profile, F(x). In order to clarify the derivation of EqJH]we have explicitly indicated 
the average location of the relevant parameters. 

Figure [3] : Dynamic force spectroscopy measurements of single imp-/3-RanGppNHp pairs 
at different temperatures. A. Distributions of measured unbinding forces using AFM for the 
lower-strength conformation of the complex at different loading rates at 7 and 32°C. Roughness 
acts to increase the separation between the distributions recorded at different temperatures. 
The histograms are fit using Gaussian distributions. The width of the bins represents the 
thermal noise of the cantilever. B. Force spectra used in the analysis. The most probable 
unbinding forces /* are plotted as a function of log(rf). The maximal error is ±10% because 
of uncertainities in determining the spring constant of the cantilevers. Statistical significance of 
the differences between the slopes of the spectra was confirmed using covariance test. (Images 
courtesy of Reinat Nevo and Ziv Reich 42j). C. Ran-importin/3 complex crystal structures 
(PDB id: 1IBR 86]) in surface (left) and ribbon (right) representations. In AFM experiments, 
Ran (red) protein complexed to importin/3 (yellow) is pulled until the dissociation of the complex 
takes place. 

Figure [4] : Dynamic force spectroscopy analysis using a model free energy profile Fq(x) = 
— Vq| (a; + l) 2 — £ 2 | with Vq = 20pN/nm, £ = 4nm, and x > 0. The lack of change in xt s as / 
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changes shows a hard response under tension. A. Effective free energy profile (F(x)) at various 
values of /. B. Distributions of unbinding forces at different loading rates. C. Plot of the most 
probable unbinding force (/*) versus logr/. 

Figure [5] : Dynamic force spectroscopy for soft response to / using the / = free energy 
profile Fq(x) = — Voexp(— £x) with Vq = 82.8pN ■ nm, £ = 4(nm) -1 . A. Effective free energy 
profile (F(x)) as a function of /. For emphasis on the soft response of the potential, the position 
of TS at each force value is indicated with arrows. B. Distributions of unbinding forces at varying 
loading rates. C. Plot of most probable unbinding force (/*) versus logr/. The slope of the 
tangent at each loading rate value varies substantially, which suggests the variation in the TS 
(inset) as r/ changes. 

Figure [6] : By tuning the value of £ (see Eq. (23)) as a function of temperature, Hammond 
and anti-Hammond behaviors emerge in the context of force spectra in the free energy profiles 
that show hard and soft responses. The condition for Hammond or anti-Hammond behavior 
depends on a (Eq. (23)). 

Figure [7] : A. A sample refolding trajectory of a RNA hairpin starting from the stretched 
state. The hairpin was, at first, mechanically unfolded to a fully stretched state and the force 
was subsequently quenched to zero at t ~ 20 fis. The time-dependence of the end-to-end 
distance shows that force-quench refolding occurs in steps. B. The deviation of the dihedral 
angles from their values in the native state as a function of time shows large departures from 
native values of the dihedral angles in loop region (indicated by the red strip). Note that this 
strip disappears around t ~ 300 fis, which coincides with the formation of bonds shown in C. fs 
is the fraction of bonds in pink that indicates that the bond is fully formed. D. The histograms 
collected from the projections of twelve stretching and force-quench refolding trajectories on the 
two dimensional plane characterized by the end-to-end distance (R) and the average correctness 
of dihedral angles ((1 — cos (0 — 0°))) around the loop region (i — 19 — 24). The scale on the 
right gives the density of points in the two dimensional projection. This panel shows that the 
local dihedral angles are coupled to the end-to-end distance R, and hence extension alone is not 
a good reaction coordinate especially in force-quench refolding. The molecular extension x is 
related to R by x = R — Rn where Rn is the distance in the folded state. 
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